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whenever necessary. Leaf image is chosen for plant recognition since it is 
available and visible all the time. Unlike flowers that are not always available 
Keywords: or roots that are not visible and not easy to obtain, leaf is the most abundant 
type of data available in botanical reference collections. A comparative study 


HOG has been conducted among three popular texture features that are Histogram 
LBP - of Oriented Gradients (HOG), Local Binary Pattern (LBP) and Speeded-Up 
Leaf recognition Robust Features (SURF) with multiclass Support Vector Machine (SVM) 
Multiclass SVM classifier. A new leaf dataset has been constructed from ten different herb 
SURF plants. Experimental results using the new constructed dataset and Flavia, an 
existing dataset, indicate that HOG and LBP produce similar leaf recognition 
performance and they are better than SURF. 
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1, INTRODUCTION 

Malaysia is rich with herbal plants that are not only useful in cooking but also is beneficial in 
medical. This useful knowledge should be let known to the public so that they can gain the benefits. Besides 
that, the access to this knowledge should be easy and fast. Thus, the development of Malaysian medicinal 
herb recognition system is a necessity. Besides Malaysia [1], researches on herbal plant recognition have 
been conducted in other countries such as China [2], Indonesia [3] and Vietnam [4]. 

The most popular part of the plant that has been used for plant recognition is the leaf because the 
leaf is available throughout the year and it is easily visible. Flowers are only available at certain time while 
the roots are not easily accessible. Besides that, leaf 1s the most abundant type of data available in botanical 
reference collections [4]. Professionals who are working together with botany identify plants through leaves 
identification [5]. But manual identification in the field 1s quite troublesome and sometimes time consuming. 
This is because the knowledge from the botanist is required for the plant recognition. Thus, to overcome 
these problems, herbal plant identification through leaf recognition application is critical. 

A robust feature is needed to cleanly distinguish among the different leaves. Leaves have various 
characteristics that can be used for recognition such as shape, color and texture. Shape feature has been used 
in [6] where the leaf’s shape contour, the convexity and concavity properties of the arches are measured. 
Hu invariant moments that represent the shape of the leaf have been utilized in [7]. But the shape can be 
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affected by the age of the plant where younger leaves may look slightly different compared to the older 
leaves and this can produce different results for the same plant. Applying color is very challenging since 
almost all plant leaves have similar colors. 

Texture feature for leaf recognition has been investigated in [8] with very good results. Scale 
Invariant Feature Transform (SIFT) texture features of plant leaves have been utilized in [5] but extracting 
the SIFT feature is a bit slow. Speeded-Up Robust Features (SURF) texture feature has been applied in [9] 
but the performance is not as good as SIFT. Fuzzy Local Binary Pattern (LBP) texture features are being 
used in [3]. Histogram of Oriented Gradients (HOG) has produced good performance in [10]. Even though 
many of the methods produce satisfactory results, there is still room for improvement. Since different 
features can produce different plant recognition results, the identification of a good feature or features is 
critical for plant recognition. Thus, a comparative study is necessary to determine the suitable feature for leaf 
recognition of the Malaysian medicinal plant. This research investigates three popular texture features 
namely SURF, LBP and HOG. A multiclass Support Vector Machine (SVM) has been utilized as the 
classifier of these features for leaf recognition. SVM is chosen because it is widely used for various object 
recognitions such as watermelon seeds exterior quality recognition [11], remote sensing image classification 
[12] and brain wave recognition [13]. 


2. RESEARCH METHOD 

Texture features provides measurements for visual patterns in images. The identification of specific 
textures in an image is achieved by modeling texture as two-dimensional gray-level variations within a 
segmented region. It can be described based on coarseness, contrast, directions, linelikeness, regularity and 
roughness [14]. HOG counts the occurrences of gradient orientation in localized regions of an image [15]. 
Figure | illustrates the HOG algorithm. The implementation of HOG algorithm is described as follows: 
Divide the image into small regions called cells; 

Compute a histogram of gradient directions for the pixels within the cell; 
Discretize each cell into angular bins according to the gradient orientation; 
Group adjacent cells into blocks; 
Normalize the block histogram. 
SURF [16] is achieved by relying on integral images and Hessian matrix-based measure. The 
implementation of SURF is described as follows: 
1. Find image interest points using determinant of Hessian matrix; 
2. Find major interest points in scale space; 
3. Find feature direction; 
4. Generate feature vectors. 

LBP [17] labels the pixels of an image by thresholding the neighborhood of each pixel and considers 
the result as a binary number. Figure 2 illustrates a sample computation of the center pixel with its 
neighboring pixels while Figure 3 demonstrates the general flow of process of LBP. The general 
implementation of LBP is explained below: 

1. Divide the examined window into cells; 

2. For each pixel in the cell, compare the pixel to each of its eight neighbors; 

3. Produce a vector of binary numbers where if the center pixel’s value is greater than the neighbor’s value; 
change it to O and | otherwise. 

4. Compute the histogram of the frequency of each number occurring; 

5. Normalize and concatenate the histogram of all cells. 

SVM was initially developed by for binary classification [18] but multiclass SVM has been 
produced to cater multiclass problems. A comparison has been conducted among four popular approaches for 
multiclass SVM namely one-against-all (OAA), one-against-one (OAQO), decision directed acyclic graph 
(DDAG) and adaptive directed acyclic graph (ADAG) and the experimental results indicate that OAO 
provides the best result [19]. Thus, this research utilizes the OAO approach for multiclass SVM where it 
consists of multiple binary, linear SVM learners. Classification is performed by a max-wins voting strategy 
where every classifier assigns the instance to one of the two classes. After that, the vote for the assigned class 
is increases by one vote and the class with the maximum votes determines the instance classification. 


See 


Leaf Recognition using Texture Features for Herbal Plant Identification (Zaidah Ibrahim) 


154 i) ISSN: 2502-4752 


¥ 4% 


ial _ |. 1. Sample 2. Difference 3. Threshold 


cell histograms 


aann 
4 o} 4 iL ane 4°44 492 + 194 + 1°S + 0°16 + 0°32 + 0°64 + 0°28 = 15 


detection window 4, Multiply by powers of two and sum 


normalized histograms 





Figure 1. Demonstration of HOG algorithm [15] Figure 2. A sample ere of the interest points 
or LBP 


Leaf image The leaf image is LBP histogram Feature 
divided into blocks from each block histogram 





Figure 3. A demonstration of the general flow of LBP 


Figure 4 illustrates the flow of process for leaf recognition. Once the image of the leaf has been 
captured, pre-processing is conducted on the image for resizing and converting the color image into gray- 
scale image. Then, the texture features are extracted and entered into multiclass SVM for leaf recognition. 
The three texture features, namely HOG, SURF and LBP are being extracted separately. 





Figure 4. Flow of process for leaf recognition 


Flavia dataset [20] consists of various orientations of leaf images from thirty three different plant 
specifies. Figure 5 shows some sample images of the leaves that have been used in this experiment where 40 
samples from each species have been used for training and 10 samples for testing. 

Figure 6 shows some sample images from the new dataset that consists of leaves from 10 different 
Malaysian herb plants where 20 samples from each species have been used for training while 5 samples for 
testing. For both datasets, the images were in various orientations and sizes. 
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Figure 5. Sample images of the leaves from Flavia 
dataset [20] 





Figure 6. Sample images of the leaves from new 
constructed dataset 


3. RESULTS AND DISCUSSION 

Table | illustrates the recognition accuracy results of the three texture features with multiclass SVM 
classifier using new constructed dataset while Table 2 shows the results for Flavia dataset. The recognition 
accuracy 1s computed by dividing the correctly recognized leaf of the testing images with the total number of 
tested images. The results of these experiments are constant for both datasets. By looking at these two tables, 
we can clearly see that HOG and LBP produce similar classification results for both datasets and their results 
are much better than SURF. Since HOG and LBP descriptors operate on localized cells, the methods uphold 
invariance to geometric and photometric transformations and robust to monotonic gray-scale changes caused, 
for instance, by illumination variations. As a result, they perform better than SURF. 


Table 1. Results of multiclass SVM for new constructed dataset 


Multiclass Classifier Accuracy (%) 
HOG 99 
LBP 99 
SURF 74 


Table 2. Results of multiclass SVM for Flavia dataset 


Multiclass Classifier Accuracy (%) 
HOG 97 
LBP 97 
SURF 63 


4. CONCLUSION 

This paper evaluated three texture features for leaf recognition, namely HOG, LBP and SURF. One 
of the important characteristic for scene images such as herb plants is tolerant to illumination changes. 
Experimental results on these features extracted from leaf images in a new constructed datasets and Flavia, 
existing datasets conclude that HOG and LBP are better than SURF. SURF is sensitive to rotation and 
illumination changes whereas HOG and LBP are not. Besides that, since HOG and LBP descriptors operate 
on localized cells, the methods uphold invariance to geometric and photometric transformations. Thus, they 
are very suitable for scene images like leaf recognition for herb plant identification. Future work is to 
perform a comparative study on the efficiency performance of these features for mobile application. 
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